When Automatic Voice Disguise Meets Automatic Speaker Verification
نویسندگان
چکیده
The technique of transforming voices in order to hide the real identity a speaker is called voice disguise, among which automatic disguise (AVD) by modifying spectral and temporal characteristics with miscellaneous algorithms are easily conducted softwares accessible public. AVD has posed great threat both human listening verification (ASV). In this paper, we have found that ASV not only victim but could be tool beat some simple types AVD. Firstly, three AVD, pitch scaling, vocal tract length normalization (VTLN) conversion (VC), introduced as representative methods. State-of-the-art methods subsequently utilized objectively evaluate impact on equal error rates (EER). Moreover, an approach restore disguised its original version proposed minimizing function scores w.r.t. restoration parameters. Experiments then from Voxceleb, dataset recorded real-world noisy scenario. results shown that, for obtains EER around 7% comparing 30% recently baseline using ratio fundamental frequencies. generalizes well nonlinear frequency warping VTLN reducing 34.3% 18.5%. However, it difficult source speakers VC our approach, where more complex forms functions or other paralinguistic cues might necessary transform VC. Finally, contrastive visualization features without illustrate role intuitive way.
منابع مشابه
Voice disguise and automatic detection
This study focuses on the question of voice disguise and the problem of its detection. The voice disguise is considered as a deliberated action of the speaker who wants to falsify or to conceal his identity. Lots of possibilities are offered to a speaker to change his voice and to false a human ear or an automatic system. He could transform his voice by electronic scrambling or more simply by e...
متن کاملA Novel Voice Activity Detection Approach for Automatic Speaker Verification
The main goal of this paper is to propose a novel voice activity detector (VAD) approach for Speaker Verification applications which is both faster and robust to noise environment. The idea is to use Support Vector Machine (SVM) parameters estimated using speech and nonspeech of a long utterance to compute a function decision of the proposed VAD approach. Speaker Verification results on TIMIT d...
متن کاملAutomatic versus human speaker verification: The case of voice mimicry
In this work, we compare the performance of three modern speaker verification systems and non-expert human listeners in the presence of voice mimicry. Our goal is to gain insights on how vulnerable speaker verification systems are to mimicry attack and compare it to the performance of human listeners. We study both traditional Gaussian mixture model-universal background model (GMM-UBM) and an i...
متن کاملEffect of voice disguise on the performance of a forensic automatic speaker recognition system
This paper presents first results of an ongoing study on the effects of common types of voice disguise, including increased voice pitch (even falsetto speech), lowered voice pitch and pinching the nose while speaking, on forensic speaker recognition (FSR) techniques. Natural and disguised speech data from 100 German speakers recorded 5 times over a period of 7 to 9 months were used in a series ...
متن کاملVoice modeling methods for automatic speaker recognition
Building a voice model means to capture the characteristics of a speaker’s voice in a data structure. This data structure is then used by a computer for further processing, such as comparison with other voices. Voice modeling is a vital step in the process of automatic speaker recognition that itself is the foundation of several applied technologies: (a) biometric authentication, (b) speech rec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Forensics and Security
سال: 2021
ISSN: ['1556-6013', '1556-6021']
DOI: https://doi.org/10.1109/tifs.2020.3023818